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Abstract —Multi-antenna or multiple-input multiple-output 
(MIMO) technique can significantly improve the efficiency of 
radio frequency (RF) signal enabled wireless energy transfer 
(WET). To fully exploit the energy beamforming gain at the 
energy transmitter (ET), the knowledge of channel state informa¬ 
tion (CSI) is essential, which, however, is difficult to be obtained 
in practice due to the energy and hardware limitation of the 
energy receiver (ER). To overcome this difficulty, under a point- 
to-point MIMO WET setup, this paper proposes a general design 
framework for a new type of channel learning method based 
on the ER’s energy measurement feedback. Specifically, the ER 
measures and encodes the harvested energy levels over different 
training intervals into bits, and sends them to the ET via a 
feedback link of limited rate. Based on the energy-level feedback, 
the ET adjusts transmit beamforming in subsequent training 
intervals and obtains refined estimates of the MIMO channel 
by leveraging the technique of analytic center cutting plane 
method (ACCPM) in convex optimization. Under this general 
design framework, we further propose two specific feedback 
schemes based on energy quantization and energy comparison, 
where the feedback bits at each interval are generated at the ER 
by quantizing the measured energy level at the current Interval 
and comparing it with those in previous intervals, respectively. 
Numerical results are provided to compare the performance of 
the two feedback schemes. It is shown that energy quantization 
performs better when the number of feedback bits per interval 
is large, while energy comparison is more effective vice versa. 

Index Terms —Wireless energy transfer (WET), energy beam- 
forming, channel learning, limited feedback, analytic center 
cutting plane method (ACCPM). 


I. Introduction 

Radio frequency (RF) signal enabled wireless energy trans¬ 
fer (WET) has become a promising technology to provide 
perpetual, cost-effective, and convenient energy supply to 
low-power wireless devices such as RF identification (RFID) 
tags and sensor nodes for various low-power applications in 
future smart commercial and industrial systems with wireless 
internet-of-things (loT) IT], |2l. In RF-based WET systems, 
dedicated energy transmitters (ETs) are deployed to control 
and coordinate the transfer of wireless energy via RE signals 
to a set of distributed energy receivers (ERs), and each ER 
uses the rectifier at each of its receive antennas (also known 
as rectennas as a whole) to convert the received RF signals 
to direct current (DC) signals, which are combined to charge 
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Fig. 1. A point-to-point MIMO WET system with closed-loop feedback. 

the battery at the ER |3 (see, e.g., a point-to-point WET 
system in Fig. [U. Examples of commercial ERs include the 
P2110 and P2210 Powerharvester Receivers developed by the 
Powercast company a. Due to the long operating range (say, 
tens of meters) and flexibility to charge multiple devices at 
the same time (thanks to the broadcast property of radio 
signal), RF-based far-held WET has competitive advantages 
over existing near-field WET techniques such as induction 
coupling, which generally operates with much shorter distance 
(say, centimeters) and for point-to-point power transfer only 
m . Furthermore, since RE signal can carry energy as well 
as modulated information at the same time, a joint investi¬ 
gation of wireless information and energy transfer is appeal¬ 
ing, which has recently drawn a great amount of attention. 
In particular, simultaneous wireless information and power 
transfer (SWIPT) (see, e.g., 15 ], 01 ) and wireless powered 
communication network (WPCN) (see, e.g., Q, 0) are two 
important operation models that have been thoroughly studied 
in literature. In SWIPT systems, wireless information and 
energy are transmitted using the same signal; while in WPCN, 
wireless information transmission is powered by WET using 
different signals. 

One major practical challenge for implementing RF-based 
WET is the significant decay of energy transfer efficiency of 
WET over the distance from the ET to the ER due to the 
severe prorogation power loss of RF signal. To improve the 
efficiency, multiple-input multiple-output (MIMO) techniques 
by equipping multiple antennas at the ET and/or the ER have 
been proposed as an appealing solution (see, e.g., 0, 0, 
0-ilo] and the references therein) 0 Multiple antennas at 
the ET help focus the transmitted wireless energy on the 
direction to the ER via the technique of digital beamforming 
or so-called energy beamforming, while multiple antennas 

* In practice, there is a limit on the number of antennas that can be deployed 
on a device of finite dimensions; however, with the prevalence of exploiting 
higher frequency bands (e.g., from several to tens of GHz) in future wireless 
systems, the antenna size will be significantly reduced and as a result it is 
more practically feasible to install multiple antennas on a single device. 
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at the ER increase the effective aperture area for received 
RF energy harvesting, both leading to significantly enhanced 
energy transfer efficiency. This helps extend the operating 
range of WET with given ERs’ energy requirements, and/or 
enable more power-consuming applications by increasing ERs’ 
harvested energy with fixed distances. However, the benefit of 
energy beamforming in MIMO WET critically relies on the 
availability of the channel state information (CSI) at the ET, 
which needs to be efficiently learned in practical systems. 

There are in general three types of channel acquisition 
methods that have been proposed in the literature for MIMO 
WET. The first method is by exploiting the channel reciprocity 
between the forward (from the ET to the ER) and reverse 
(from the ER to the ET) links inTl- lfT3]l . where the ET obtains 
the forward link CSI by performing a reverse link channel 
estimation based on the training signals sent by the ER. It 
is worth noting that this method is only applicable for time- 
division duplex (TDD) based systems, where the condition of 
channel reciprocity practically holds with good accuracy. 

The second method is via sending training signals directly 
in the forward link from the ET to the ER, through which 
the ER estimates the MIMO channel and then sends the 
estimated channel back to the ET via the reverse link m. 
This method is reminiscent of the conventional channel es¬ 
timation and feedback approach in wireless communication 
(see, e.g., na and the references therein), which is applicable 
for both TDD and frequency-division duplex (FDD) based 
systems. However, under the new setup of WET (instead of 
wireless communication), this method requires the ER (not an 
information receiver) to implement additional baseband signal 
processing for channel estimation, which is not feasible with 
the existing hardware at the ER designed only for RF energy 
harvesting (instead of information decoding) as shown in Fig. 

m 

To cater for the hardware limitation of existing ERs, in 
our previous work m we proposed a third channel learning 
method for MIMO WET based on the energy measurement 
feedback by the ER. Specifically, the ER measures its har¬ 
vested energy levels over different training intervals using an 
energy meter (see Fig. [TJ, and sends one feedback bit to the 
ET per interval to indicate the increase or decrease of the 
measured energy in the current interval as compared to that 
in the previous interval. Based on the feedback bits collected 
in the present and past intervals, the ET adjusts its transmit 
beamforming in subsequent training intervals and obtains 
refined estimates of the MIMO channel. Note that similar 
channel learning methods with one-bit feedback have also 
been studied in cognitive radio MIMO systems ifTTl . Ea. This 
energy feedback based channel learning method is applicable 
for both TDD and FDD systems, and can be implemented 
without additional baseband processing modules at the ER. 
However, the existing studies in Ea-nsi considered only one 
feedback bit per training interval; while it remains unknown 
how to efficiently implement this method for the general setup 
with multiple feedback bits available per interval. This thus 
motivates our study in this work. 

In this paper, we propose a general channel learning de¬ 
sign framework for MIMO WET based on the ER’s energy 


feedback over a finite-rate reverse link from the ER to the 
ET (thus including the one-bit feedback per interval designed 
in ca as a special case). Note that our results can also be 
similarly applied to other applications for channel learning in 
wireless communications with received signal power/energy 
based feedback (such as cognitive radio systems in ifTTll . ifTSll I. 
We consider a quasi-static channel model and a block-based 
energy transmission with each block spanning over a constant 
MIMO channel for WET. Each block is further divided into 
two phases for channel learning and energy transmission, 
respectively. In the channel learning phase, the ET learns the 
MIMO channel progressively by adjusting its training beam¬ 
forming weights according to the received energy feedback 
from the ER. It is assumed that at each feedback interval 
the ER sends B > 1 bits to the ET via the reverse link. In 
general, the feedback bits at each interval are generated by the 
ER via efficiently encoding the measured energy levels at the 
present and past intervals subject to the given feedback rate 
constraint. Based on the estimated MIMO channel from the 
channel learning phase, in the subsequent energy transmission 
phase, the ET implements the optimal transmit energy beam¬ 
forming to maximize the energy transferred to the ER. In this 
paper, we focus our study on the channel learning phase by 
investigating the energy feedback design at the ER and the 
training beamforming design and channel estimation method 
at the ET. Our main results are summarized as follows. 

« First, we present a general design framework for the 
energy measurement feedback based channel learning 
with limited feedback rate. Similar to ifTbll for the special 
case of one-bit feedback (i.e., B = 1), our design is 
based on the technique of analytic center cutting plane 
method (ACCPM) in convex optimization El. The basic 
idea of ACCPM is as follows: Based on the B feedback 
bits from the ER at each interval, the ET constructs 
one or more linear inequalities on the MIMO channel 
matrix to be estimated, where each inequality is used as 
a cutting plane for helping localize the MIMO channel 
in a multi-dimensional complex matrix space. Based on 
these cutting planes, the ET also designs the transmit 
beamforming for subsequent training intervals and ob¬ 
tains refined estimates of the MIMO channel. 

• Under this general framework, we then propose two 
specific feedback designs, namely, energy quantization 
and energy comparison, where the B feedback bits per 
interval are generated at the ER by quantizing the mea¬ 
sured energy level at the current interval and comparing it 
with those in the B previous intervals, respectively. Note 
that when B = 1, the energy comparison based channel 
learning scheme degenerates to that with one-bit feedback 
per interval as in our previous work ifTbl . 

• Finally, we provide extensive numerical results to com¬ 
pare the performance of our proposed feedback and 
channel learning schemes in terms of both mean-squared 
error of the estimated MIMO channel norm as well as 
the resulting energy beamforming gain. It is shown that 
energy quantization generally preforms better than energy 
comparison when B is large, while energy comparison is 


3 


more effective vice versa. 

The remainder of this paper is organized as follows. Section 
|II] introduces the system model and the two-phase transmission 
protocol. Section|III]presents the general design framework for 
the energy feedback based channel learning using ACCPM. 
Sections HVl and [V] present the energy quantization and energy 
comparison based feedback and channel learning schemes, 
respectively. Section |VT] presents a complexity reduction ap¬ 
proach for the proposed schemes by pruning irrelevant cutting 
planes in ACCPM. Section IVIII provides numerical results 
to evaluate the performance of proposed schemes. Finally, 
Section I VIII I concludes the paper. 

Notation: Boldface letters refer to vectors (lower case) or 
matrices (upper case). For a square matrix S, det(S') and 
tr(S') denote its determinant and trace, respectively, while 
S' A 0 means that S is positive semi-definite. For an arbitrary- 
size matrix M, ||iVT||F, rank(AT), , and denote 
the Frobenius norm, rank, conjugate transpose and transpose 
of M, respectively. I and 0 denote an identity matrix and 
an all-zero matrix, respectively, with appropriate dimensions. 
(^x^y denote the space of a; x y complex and real 

matrices, respectively. E(-) denotes the statistical expectation. 

11 a; 11 denotes the Euclidean norm of a complex vector x, and \z\ 
denotes the magnitude of a complex number z. \x\ denotes 
the largest integer not greater than the real number x, and 
[a;] denotes the smallest integer not less than x, respectively. 
Symbol j denotes the complex number 

II. System Model 

We consider a point-to-point MIMO system for WET as 
shown in Eig. [T] in which one ET with Mt > 1 transmit 
antennas sends wireless energy via RE signal to one ER with 
Mu > 1 receive antennaso We assume a quasi-static flat¬ 
fading channel model, in which the wireless channel remains 
constant over each block of our interest and may change from 
one block to another. Let the block length be denoted by T > 
0, which is assumed to be sufficiently long for typical WET 
applications with low mobility receivers. 

We consider transmit beamforming at the ET for WET. 
Without loss of generality, we assume that there are a total 
of d energy beams employed at the ET, where 1 < d < Mt is 
a design parameter that will be specified later. We denote the 
transmitted energy signal at the ET as a; = where 

Wi € ^ ^ denotes the Ah transmit beamforming vector and 

Si represents the corresponding energy-bearing signal. Since 
the energy signal s^’s do not carry any information, they are 
assumed to be independent pseudorandom sequences with zero 
mean and unit variance, i.e., E(|sip) = l,Vi G d}, 

in order to spread the signal power over the frequency to 
satisfy certain power spectral mask constraint imposed by the 
regulations on microwave radiation. Accordingly, the transmit 
covariance matrix at the ET is given by = E {xx^^ = 
Wiwf A 0. Note that for any given transmit covariance 

^Our framework is also applicable for distributed energy beamforming 
systems, where multiple distributed single-antenna ETs cooperatively send 
energy to one or more ERs provided that a central control unit (e.g., one of 
the ETs) is available to coordinate the transmission from multiple ETs by 
adopting the techniques proposed in this paper. 


matrix S, the corresponding energy beamforming vectors, i.e., 
Wi,..., Wd, can be obtained via performing the eigenvalue 
decomposition (EVD) on S, where d = rank(S'). In addition, 
we assume that the maximum transmit sum-power of the ET 
is given by P > 0 , and thus we have E(||ai|p) = tr {S) < P. 

With energy beamforming at the ET, the ER can harvest 
the energy carried by the d energy beams from all the Mu 
receive antennas. Let H £ ^MrxMt denote the MIMO 
channel matrix from the ET to the ER, and define G = 
H £ (.Q jjjg energy conservation, 

the harvested RE-band power from all receive antennas at the 
ER is proportional to o that of the equivalent baseband signal 
a, 0. As a result, the harvested energy at the ER in one 
particular block of interest is expressed as 

Q = r^TE{\\Hxf)=7^Tti-{GS), (1) 

where 0 < p < 1 denotes the energy harvesting efficiency at 
the ER. Since p is a constant, we normalize it as 77 = 1 in the 
sequel of this paper unless otherwise stated. 

A. Energy Transmission With and Without CSI 

In this subsection, we present the energy signal design 
with or without CSI at the ET. Eirst, consider that the ET 
perfectly knows the CSI. In this case, the ET designs the 
energy beamforming to maximize its transferred energy to 
the ER subject to the given transmit sum-power constraint. 
Mathematically, the transferred energy maximization problem 
is formulated as 


max Ttr (GS) 

S 

s.t. S' A 0, tr (S) < P. (2) 

Let the Mt eigenvalues of G be denoted by Ai > 
... > Xmt ^ ^nd the corresponding eigenvectors by 
Vi,... ,vmt^ respectively. Then it is shown in 0 that the 
optimal solution to problem (|2|i is given by S* = Pviv^, 
which achieves the maximum transferred energy to the ER 
given by 


Q* = TPXi. 


( 3 ) 


Here, it follows that tr(S*) = 1; as a result, only one single 
transmit energy beam is needed to achieve the optimality. 
Such a single-beam solution is termed as optimal energy 
beamforming (OEBj. 

On the other hand, consider the case when the ET does not 
have any knowledge about the CSI. In this case, the OEB is not 
implementable. Alternatively, the ET can employ an isotropic 
transmission to radiate its transmit power uniformly over all 
directions, with the corresponding transmit covariance matrix 
given by Siso = P/MtI. Correspondingly, the harvested 
energy at the ER is expressed as 


Qiso = Ttr{GSiso) = 


rPtr(G) 

Mt 


TPEfJi 

Mt 


( 4 ) 


By comparing Q* in 0 and Qiso in @, it is observed 
that the OEB with perfect CSI at the ET offers an energy 
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Fig. 2. The two-phase transmission protocol for MIMO WET. 


beamforming gain over the isotropic transmission without CSI, 
which can be expressed as 

^ Q* _ MtXi _ Mt\i 

Due to the fact that Ai > ... > \mt ^ 0, it is evident that 
1 < X* < Mt, where x* = 1 holds when Ai = • • • = \mt > 
0 (with rank(G) = Mt), while x* = -Afy is attained when 
Ai > 0 and A 2 = ■ ■ ■ = \mt = 0 (he., rank(G) = 1). 
For practical WET systems with Mt > Mu, we normally 
have Ai » A 2 > • • • > \mr > 0 (and Amb+i = • • • = 
Amt = 0 if Mt > Mji), since the wireless channel is often 
line-of-sight (LOS) dominant given the limited transmission 
range of most interest (say, several meters). In this case, an 
energy beamforming gain x* ~ Mt can be achieved with the 
availability of CSI at the ET. 


B. Two-Phase Transmission Protocol 

To achieve the promising energy beamforming gain in prac¬ 
tice, we consider a two-phase transmission protocol for the ET 
to learn the MIMO channel and based on the estimated channel 
implement the OEB. As shown in Eig. |2l each transmission 
block is divided into two phases, for the purposes of channel 
learning and energy transmission, respectively. We explain 
these two phases in detail as follows. 

Eirst, the channel learning phase corresponds to the first 
r amount of time in each block, 0 < t < T, which is 
further divided into > 1 training intervals. Let Tg denote 
the length of each training interval. Then we have r = NTg. 
In each of the N intervals, the ET transmits different sets of 
energy beams (each set corresponding to a different transmit 
covariance matrix) to the ER. Let the transmit covariance 
matrix at the ET in the interval n G N} be denoted 

by Sn, where h 0 and tr(5'„) < P. Then the harvested 
energy by the ER in the n-th interval is given by 

=TgtT{GSn),nG{l,...,N}. (6) 

Specifically, the ET implements energy measurement via the 
energy meter shown in Eig. [T]in the first Tm amount of time 


in each intervaljl and then sends back the measured energy 
information to the ET at the remaining Tf amount of time, 
where Tm + Tf = Tg. The harvested energy amount measured 
by the ER in the n-th interval is denoted by 

Q„ = r™tr(G5„),ne{l,...,A^}. (7) 

It is assumed that the ER can send B bits to the ET at each 
interval via the reverse link, where B = [RTf\ > 1 is an 
integer with R denoting the feedback rate from the ER to the 
ET. The value of B is practically finite since the ER is energy- 
constrained for sending training signals and the available 
bandwidth for feedback may be shared among multiple ERs 
(to their respective ETs). We denote the i?-bits feedback at 
each interval n as /„. In general, /„ specifies the energy 
feedback scheme in use at the ER which is a mapping from its 
present and past energy observations (i.e., Qi,..., Qn) to the 
B feedback bits at interval n,n G {1,2,..., N}. Eurthermore, 
let Em > 0 and Ef > 0 denote the energy consumed by the 
ER for the energy measurement and the information feedback 
in each interval, respectively. Note that the values of Tm and 
Tf should be chosen in practice by considering the sensitivity 
of the energy meter at each ER and the feedback link capacity 
from each ER to the ET, respectively, as well as the ER’s 
energy consumption for energy measurement and feedback. 

Eor the purpose of exposition, we make the following 
assumptions on the ER’s energy measurement and feedback. 

• The energy measurement of each Qn at the ER is per¬ 
fectly accurate, while the effect of energy measurement 
error at the ER on the channel learning performance will 
be shown by simulation in Section IVII-DI 

• The feedback link is ideal and each feedback /„ is 
received by the ET at the end of interval n without any 
error or delay; 

Based on the feedback {fn}n=i’ '^he ET obtains a final 
estimate of the MIMO channel at the end of the channel 
learning phase. The details of energy feedback design at the 
ER and training signal design and channel estimation method 
at the ET will be given later. 

Next, in the energy transmission phase with the remaining 
time T—t, the ET implements the OEB for WET based on the 
estimated MIMO channel from the previous channel learning 
phase. Let the dominant eigenvector of the estimated MIMO 
channel be denoted by Vi. Then the ET sets the transmit 
covariance matrix in the energy transmission phase as 

Set = Pfii-Of. (8) 

Accordingly, the transferred energy to the ER is expressed as 
QET = P(T-T)fifGfii. (9) 

By combing the channel learning and energy transmission 
phases, the net energy harvested by the ER over one whole 
block is expressed as its totally harvested energy minus the 

^Alternatively, the ER can use multiple energy meters each for measuring 
the harvested energy at one receive antenna. In this case, the MIMO channel 
learning becomes estimating an Mji number of multiple-input single-output 
(MISO) channels each for one receive antenna. This can increase the channel 
learning convergence rate at the cost of higher hardware and feedback costs 
at the ER. 
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total energy consumed for energy measurement and feedback, 
i.e., 

N 

Qnet — y] TMGSn) + P{T - t)v^Gvi -N{Em + Ef). 

n—1 

( 10 ) 


Accordingly, we define the practical energy beamforming gain 
achieved by the two-phase protocol as 


X = 


Q. 


Qu 


( 11 ) 


For typical WET applications with T being sufficiently large, 
it follows that t ^ T and TstT{GSn) ^ P{T — 

t)v^Gvi. Furthermore, it can be assumed that the energy 
consumption E^n and Ef are negligible as compared to 
the totally harvested energy, i.e., N^E^ + Ef) <C P(T — 
t)v^Gvi 0 Therefore, we approximate the practical energy 
beamforming gain as 


_ P(T -t)v^Gvi ^ ^ Mtv^Gvi 

^ Prtr(G)/M t ^ “ tr(G) 


( 12 ) 


For convenience, in the sequel of this paper we use the 
approximate energy beamforming gain x the performance 
metric for proposed feedback and channel learning schemes. 
Note that when the channel estimation is sufficiently accurate, 
i.e., Vi Ri Vi, the energy beamforming gain x approaches that 
by the OEB with perfect CSI at the FT x* in ©, i e., x ~ X*- 


III. A General Channel Fearning Design 
Framework Based On ACCPM 

In this section, we present a general framework for design¬ 
ing the energy feedback at the ER and the corresponding 
training signal and channel estimation at the FT based on 
ACCPM. ACCPM is well known as an efficient localization 
and cutting plane method for solving convex or quasi-convex 
optimization problems nsi, with the objective of finding one 
feasible point in a convex target set, where the target set can 
be the set of optimal solutions to the problem of interest. The 
implementation of ACCPM is based on the use of cutting 
planes, each of which is a hyperplane that separates some 
of the current points from the points in the target set. Based 
on the returned cutting planes in each iteration, the target set 
can be localized via a sequence of convex working sets with 
reduced size. 

To implement ACCPM in our context, we first define the 
target set for channel learning as follows. Note that it follows 
from ® that the implementation of OEB at the FT in the 
energy transmission phase only depends on bi, which is the 
estimate of the dominant eigenvector of the matrix G = 
H. For convenience, in the sequel we denote G as the 
MIMO channel to be estimated, instead of the actual MIMO 
channel H. To obtain Vi, it suffices for the FT to estimate 
any positively scaled matrix of G in the channel learning 

maximize the net energy hai'vested at the ER, there in general exists 
a design tradeoff in choosing r versus T — r between channel learning and 
energy transmission. Due to page limitation, we omit such discussion here 
and the interested readers can refer to [n-Ei, nsi for details. 


phase. Without loss of generality, we focus on learning the 
normalized MIMO channel given by G = G/tr(G) with 
tr(G) = 1. Accordingly, the target set of our interest is defined 
as = {G}. 

Next, we explain the principle for designing the feedback 
{/„}, and show that they in fact play the role of specifying 
the cutting planes in ACCPM for localizing ip. To facilitate 
the design of {/n}, we first set the transmit covariance 
matrix at the FT in the first interval n = 1 to be Si = 
P/MtI (i.e., isotropic transmission), based on which the 
corresponding harvested energy amount measured by the ER 
is Qi = Tmtr{GSi) = TmP/MTtT{G). By using Qi as a 
reference energy level, we then normalize the harvested energy 
measurement in each interval n G {1 ,...,W} as 


r, _Qn _ MTtr(GS„) 

Qi~ Ptr(G) 


A/t" 


tr (GS„) . 


(13) 


Note that for interval n = 1, we have Qi = tr(G) = 1; as 
a result, the feedback over this interval /i will not provide 
any information on G and thus can be ignored. This assump¬ 
tion is made specifically for our proposed channel learning 
algorithms, while in general it may not be optimal for other 
designs. Since Qn, n G {2,..., N} is a real number but /„ 
is of B bits only, each feedback /„ cannot completely convey 
Qn to the ET at interval n; therefore, /„ should be designed by 
properly encoding the present and past energy measurements, 
i.e., Q 2 , ■ ■ ■, Qn, at each interval n G {2,..., N}. The optimal 
encoding scheme subject to B feedback bits per interval is 
a difficult problem to solve; thus we will later propose two 
suboptimal designs for efficient practical implementation in 
Sections |IV] and [V] respectively. For each proposed feedback 
scheme, a set of new linear inequalities (or cutting planes 
in ACCPM) on G are obtained from /„ at each interval 
n G {2,..., N}, which are generally expressed as 


tr (Sn,cG) — Jn.c ^ 0, C G {1, . . . , Cn}, (14) 


where G„ > 1 denotes the number of inequalities/cutting 
planes obtained at interval n, and {S„_c} and {jn.c} 
parameters depending on the design of transmit covariance 
matrix 5'„’s and feedback /„’s in each scheme (to be specified 
later). Each of the G„ cutting planes in (fT4l i ensures that the 
MIMO channel G should lie in the half space of 

Pn.c = ^Sn,cG^ — Xn.c ^ oj- , 

cG{l,...,Cn},nG{2,...,N}. (15) 


Now, based on the returned cutting planes in (fT4l i. we 
are ready to obtain a sequence of working sets to localize 
the target set ip. Since G ^ 0 and tr(G) = 1 are a 
priori known by the ET, we have the initial working set as 
Vo — |g|G h 0,tr(G) = l|, with Vq 3 p. Furthermore, 
for the first interval n = 1, we have the working set Vi =Vo, 
since Qi = tr(G) = 1 is a constant and /i does not 
contain any information on G. Then, for the subsequent 
intervals n G {2, ...,V}, the working set can be updated 
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Fig. 3. Illustration of using neutral and deep cutting planes to update the 
working set at interval n G {2,..., A^} with ACCPM: (a) the working set 
Vn—i containing the MIMO channel G to be learned; (b) a neutral cutting 
plane that passes through Gn—i, which is the analytic center of Vn-i, to cut 
nearly a half space of Vn—i, with tr — 7^,1 = 0; (c) a deep 

cutting plane that locates between G and Gn-i to cut a more substantial 
sub-space of Vn—i, with tr — 7^7 > 0 . 


as Vn = Vn-i n 1-Ln,i *'' H 1-Ln,c,,. or more explicitly, 

'Pn = •|g'|G ^ 0, tr ^6^ = 1, tr — 7m,c ^ 0, 

Vce {2, (16) 

It is evident that Vq = Vi ^ ^ Vn 2 Therefore, 

the sequence of obtained working set P„’s shrink in size and 
eventually converge to (/J as TV —c». Here, the convergence of 
our proposed ACCPM based channel learning can be proved, 
similarly as that given in M for the special case of B = 1. 
For brevity, we omit the proof in this paper. 

Finally, to complete the ACCPM based channel learning, it 
remains to design the cutting planes in (fT4l i in terms of d 
and {7„,c}. c € {1,..., Cn}, for each interval n € {2,..., N} 
via adjusting the transmit covariance matrix Sn- Consider a 
feedback interval n with the given working set Vn-i as shown 
in Fig. [Ha). Then, based on the principle of ACCPM lfT9l , 
a desired cutting plane should be neutral or deep. A neutral 
cutting plane should pass through the so-called analytic center 
of Vn-i, denoted by G„_i, so that nearly a half portion of 
Vn-i can be “cut” or eliminated, as shown in Fig.|3b); while 
a deep cutting plane should locate between the analytic center 
Gn-i and the MIMO channel G to be learned, such that a 
more substantial sub-space of Vn-i can be cut as compared 
to a neural cutting plane, as shown in Fig.^c). Here, for each 
convex working set Vn, its analytic center is defined as 

Gn =arg min — log det (G) — 

GbO,tr(G) = l ^ ^ 

n Cm 

X] X (-tr (Sm.cC) + 7m.c) , n G {1,..., V}. 

m—2 c—1 

(17) 

Note that the minimization problem in dnii can be shown 
to be convex EQl, and thus is solvable by standard convex 
optimization techniques such as CVX ED. Also note that we 
have Gi = I/Mt from (fTTl i. As a result, in order for the cth 
cutting plane to be neutral or deep (see Fig. O, we design the 
transmit covariance matrix Sn to ensure 

tr (s„_cG„_i^ - 7„,c > 0, c G {1,..., Cn}- (18) 


TABLE I 

A Generic Algorithm for Channel Learning based on ACCPM 


Algorithm 1 

1) Initialization: Set n — Q and Si = P/MtI- 

2) Repeat: 

a) n ■<— n -|- 1; 

b) The ET transmits with S^; 

c) The ER determines (if n > 2) based on the normalized energy 
measurements, Qi, . . . , Qn in GD, and then sends f n ba ck to the ET; 

d) The ET extracts a set of Cn cutting planes given in 1141 based on fn (if 
n > 2), and computes the analytic center as in 1171 ; 

e) The ET designs Sn.+i to ensure tr(Sn+i cGn) — 7n-f i c > 0 for some 

cG {l,...,Cn}. 

3) UntU n> N. 

4) The ET obtains the channel estimate as G — Gn . 


Note that when G„ > 1, it may be difficult to guarantee all 
the Cn cutting planes to be neutral or deep. In such cases, 
we can select any one of them to be the neutral/deep cutting 
plane to satisfy (fTSI l. The detailed design of such Bn’s will 
be explained later for each specific feedback scheme to be 
introduced. 

In summary, a generic algorithm for channel learning based 
on ACCPM is described as Algorithm 1 in Table I. Note that 
at the last interval N, the ET adopts the analytic center Gn 
as the final estimate of the MIMO channel in the channel 
learning phase, given by G = Gn- Under the above general 
design framework with ACCPM, in the next two sections, we 
present two specific feedback schemes for channel learning, 
respectively. 

IV. Energy Quantization Based Eeedback and 
Channel Learning 

In this section, we propose an energy quantization based 
feedback scheme, where at each interval n the ER obtains the 
feedback /„ by quantizing its normalized energy measurement 
Qn at the current interval into B bits. Eor this scheme, we 
denote the feedback at the ER as '^he transmit covari¬ 
ance matrices at the ET as the updated sequence of 

analytic centers as |g|j and the estimated MIMO channel 


A. Feedback Design at ER 

Eirst, we present the feedback design for ’^he ER 

by quantizing the normalized energy measurements {On}- 
Among various quantization methods E2l . we adopt a sim¬ 
ple uniform scalar quantizer that is of low complexity for 
energy-efficient implementation at the ER. Consider an in¬ 
terval n G {2,...,V}, and assume that the normalized 
energy measurement Qn is within the unit interval (0,1]. 
Note that if an overflow (i.e., Qn > 1) occurs, then we 
set Qn = 1. To minimize the probability of overflow, we 
should properly set the transmit power at the ET as will be 
specified later (see (|25])). Then, the uniform quantizer maps 
Qn into a discrete variable that belongs to a finite set, given 
by { Y + = 0,..., 2® — l}, where there are in total 2® 

quantization levels resulted from the B feedback bits, and 
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A = 2 ^ denotes the quantization step size. As a result, the 
quantized value is expressed as 


Qn 


\2^Qn\ 

2B 


1 

2^’ 


(19) 


where \x\ denotes the minimum integer that is no smaller 
than X. Accordingly, by encoding the quantized value Qn as 
a B-bits digital codeword, the feedback /n^^ can be obtained, 
which is then sent back to the ET. 

After receiving the feedback fn \ n £ {2,..., N}, the ET 
has the knowledge of the quantized value Qn, and accordingly 
knows that the normalized energy measurement Qn should lie 
between two thresholds Qn — js+r and Qn + 2 ^+Tj i.e.@ 

Qn ~ 2_B+1 — Qn A Qn + 2B+1 ’ 

By using (l20l) together with (fTTl l. it follows that 

_^tr(G5W)+Qn-^<0, (21) 

^tr(G5W)-Qn-^<0. (22) 

The two inequalities in (ITTI i and (l22l i correspond to Gn = 2 
cutting planes in the general expression (fl4li to localize the 
MIMO channel G with /P, 7 „,i = -Qn + 

2B + 1 ^ '^n ,2 = ^T^n'’ / P, and 7„_2 = Qn + 2^ + 1 ■ 


B. Training Signal Design and Channel Estimation at ET 

Based on the feedback fn^'^’s, the ET obtains the working 
set for the interval n £ N} similarly as in (fThl l by 

replacing the cutting planes in (fTTl i as those in dTTl i and (l22l i. 
i.e.. 


TABLE II 

Energy Quantization Based Scheme 


Algorithm 2 

1) Initialization: Set n — 0 and = P/MtI- 

2) Repeat: 

a) n ■<— n + 1; 

b) The ET transmits with 

c) The ER determines (if n > 2) by encoding the quantized value Qn in 
GH as a S-bits digital codeword, and then sends back to the ET; 

d) The ET updates the working set as in |23| , and computes the analytic 

center of as given in |24| ; 

e) The ET generates given in |26| . 

3) UntU n> N. 

4) The ET obtains the channel estimate as 


(I 22 I 1 to be neutral or deep with respect to Towards this 

end, we design to ensure that 



where i corresponds to the threshold between the two con¬ 
secutive quantization levels i — and i + ^b+t- From 

dZSb , it follows that when Qn > ^ + 2^+1 ^ we have 

-^fti (^Gn-i^n^'j +Qn- 2 pT > 0 , and hence the cutting 
plane in (ITTT i is neutral or deep; otherwise (when Qn < 
1 - t^ft), we have ^tr -Qn- 5 ^ > 0 , and 

thus the cutting plane in (iTTl i is also neutral or deep. Thus, 
both cases satisfy the requirement given in (fTSl l for ACCPM. 
In particular, at each interval n £ {2,..., N}, we set the 
transmit covariance matrix as 

Sl^^=PnA^An, (26) 


iP« = |G|GA0,tr (g) =1, 

- ^tr (g5«) + Q„, - ^ < 0, 

^tr (gS-W) - - ^ < 0,Vm £ {2,... ,n}|. 

(23) 

Accordingly, the analytic center of is expressed as 

g|j^^ =arg min — logdet^G') 

G70,tr(G) = l ^ ^ 

- E log (^tr (gsW) - Qn. + 

- E log (O™ + pj+r - (gsS')) ^ 

m=2 ^ 2 

(24) 

Einally, to complete the energy quantization based scheme, 
we design the transmit covariance matrix at each interval 
n £ {2,..., N} to make one of the cutting planes in (ITTl i and 

^It is worth noting that if Qn = p+i, then the first inequality in 
becomes Qn > 0, which is known a priori and thus is redundant. On the 
other hand, if Qn = 1 — , then the second inequality in (20) becomes 

Qn < 1, which, however, may be unreliable due to the possibility of overflow. 
In the above two cases, the coiTesponding inequalities should be discai'ded. 


where An is an Mt x Mt complex matrix, each element 
of which follows a circularly symmetric complex Gaussian 
(CSCG) distribution with zero mean and unit variance, and to 
satisfy (IZSl l p„ > 0 is given by 


Fn- TTm d 

2MTtT[Gl\A^An) 

Here, the randomness of An in s!n'^ helps make the cutting 
planes over different feedback intervals as diverse as possible, 
so that they are more effective in cutting. Eurthermore, several 
random trials may be required to make sure that tr (s‘'>) < 

In summary, the energy quantization based feedback and 
channel learning scheme is described as Algorithm 2 in Table 

im 

C. Special Case: B ^ oo 

It is interesting to discuss the energy quantization based 
scheme in the extreme case of unlimited number of feedback 
bits per interval, i.e., B c». In this ideal case, we have 
2 J +1 0, and the two inequalities in dTTI) and (l22li can be 

combined into one single linear equality as 

^tr (g^W) - Q„ = 0, n £ {2,..., A}. (28) 




























As a consequence, the computation of the analytic center in 
(l24l i at the interval n G {1,..., n} can be re-expressed as 

=arg min—logdet fG) 

G ^ ^ 

s.t. ^^tr -Qm = 0,Vm S {2,... ,n} 

G^O,tr(G)=l. (29) 

To draw insights from problem ( |29] |, we define a vector op¬ 
eration cvec(-) as follows. For any complex Hermitian matrix 
X. G that contains independent real elements, the 

vector operation cvec(-) maps X to a real vector cvec(X) G 
z > 1, where all elements of cvec(X) are independent 
from each other, and tr(Xl^) = (cvec(X))^cvec(Y^) for any 
given complex Hermitian matrix yE Therefore, by defining 
g = cvec(G) G and = cvec(5^^^) G we 

can rewrite the n linear equality constraints in problem (l29l l 
as 

= <3m,Vm € 

(cvec(I))'^g = 1, (30) 


which correspond to a set of n linear equations with M|. real 
unknowns. Based on (l30l l. we discuss the solution G^ ^ in 
(|2^ by considering the following two cases with n > M|. 
and n < M^, respectively. 

• In the case with n > M|,, the number of linear equations 
in (l30l l is no smaller than that of the real unknowns. 
In this case, as long as the vector Sm^’s are linearly 
independent such that rank(iT„) = M|. with = 


cvec(/),s^^\ ...,; 


,(i) 


1 T 


^nxM^ 


, the solution of g 

to (l30l l is unique, and so is the optimal solution G^ ^ 
in (|29] | due to the one-to-one mapping of cvec(-). As 
a result, the obtained G^ ^ is indeed the exact MIMO 
channel G to be learned, i.e., G^ ^ = G. 

In the case of n < M|,, the set of equations in (l30l l corre¬ 
sponds to an underdetermined system of linear equations. 
In this case, the solution of g to (EOll is not unique in 
general, and so is the obtained g|j ^ in (l29T l. As a result, 
G^ ^ may not be the exact MIMO channel G, i.e., channel 
estimation error can occur. 


From the above discussion, it is evident that when B ^ oo, a 
minimum number of M|. feedback intervals are sufficient for 
the FT to obtain an exact estimate of the MIMO channel with 
the energy quantization based scheme. 


*The mapping between the complex Hermitian matrix X € and 

the real vector cvec(X) € z > 1, can be realized as follows. The 

first z elements of cvec(.X') consist of the diagonal elements of X (that are 
real), i.e., [X]aa’s, Va G {1,..., z}, where [X^ab denotes the element in 
the ath row and bth column of X; the next ^ elements of cvec(X) 
are composed of the (scaled) real part of the upper (or lower) off-diagonal 

elements of X, i.e., \/a^b G z}, a < 6; and the 

2 _ ^ 

last ^ 2 ^ elements of cvec(X) correspond to the (scaled) imaginary part 

fXl -.fXl 

of the lower off-diagonal elements of X, i.e., j- ——^’s, Va, 6 G 
{1,..., z}, a < b. 


V. Energy Comparison Based Feedback and 
Channel Learning 


In this section, we propose an alternative feedback design 
so-called energy comparison, where the B feedback bits per 
interval indicate the increase or decrease of the harvested 
energy at the ER in the current interval as compared to those 
in the previous B intervals, respectively. For this scheme, we 


denote the feedback at the ER as 


{/P}, 


the transmit covari¬ 


ance matrices at the ET as the updated sequence of 


analytic centers as 

^(2) 

as G . 


{cf}. 


and the estimated MIMO channel 


A. Feedback Design at ER 

In the energy comparison based scheme, the ER designs 
each of the B bits in separately. Consider a particular 
feedback interval n G {2,..., N} and let the 6th bit of 
be denoted by f^l, b G {l,...,i3}. Then we use f^l to 
indicate the increase or decrease of the normalized harvested 
energy Qn by the ER in the current interval n as compared 
to Qn-b in the 6th previous interval, 6 G {1,... ,B}. More 
explicitly, we have 


f(2) ^ 

n,b 


1, if Qn Qn—b 

1, if Qn ^ Qn—b 


(31) 


— ^2) 

Here, we assume Qn-b = 0 and = 0 for any n and 6 
with n — 6 < 0 for notational convenience. By combining the 

obtained sends it back to the ET. 

After receiving the feedback /„ , at each interval n G 
{2,..., Nj, the ET extracts the following min(i?, n—1) linear 
inequalities on the MIMO channel G based on (ImT i as well 
as (fT^FI 


/(')tr (G < 0, 6 G {1, ..., min(B, n - 1)}, 

(32) 


which correspond to G„ = min(B,n — 1) cutting planes 
in the general expression (fT4l i to localize the MIMO chan¬ 
nel with S„^b = 7n,f, = 0,6 G 

min(f3, n — 1)}. 


B. Training Signal Design and Channel Estimation at ET 

( 2 ) 

Based on the feedback /A s, the ET can obtain the working 
set for the interval n G A^} similarly as in (fT^ by 

^Note that for any 6 > n in i32\ , = —1 always holds due to “ 

0. In this case, the inequality due to = — 1 becomes tr > 0, 

which is trivial and does not provide any information on the MIMO channel 
G ^ 0 (due to y 0). Therefore, only with b < min(S, n — 1) 

are used for generating inequalities for channel learning. 
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replacing the cutting planes in (fT^ as those in < l32l t: 
=|G|GhO,tr (g) =1, 

/i>(G <0, 

V6 £ {1,..., min(m — 1, B)}, Vm £ {2,..., n}|. 

(33) 

(2) 

Accordingly, the analytic center of Vn is expressed as 

G^ ^ = arg min — log det (g) 

GtO,tr(G)=i ^ ^ 

n min(m —1,5) 

-E E i°i;(-/iX6(sS'-s™.))). 

m=2 b=l 

(34) 

Finally, we design the transmit covariance matrix for 
the interval n £ {2, to make the cutting planes in 

(l3^ neutral with respect to G„_i (thus, satisfying (fT^ for 
ACCPM), i.e., 

tr = 0,6£ {l,...,min(n- 1,B)}, 

(35) 

where the above equalities hold regardless of the sign of 

fnb- However, if min(n — 1,B) > 1, then it is infeasible 

to find such an to satisfy all these equations in (IlST i 
at the same time. To overcome this difficulty, we propose 
to only ensure the neutral cutting plane for 6 = 1, i.e., 
tr = 0. Towards this end, we set 

=S^^l, + An. (36) 

Here, A„ is a Hermitian probing matrix, which is neither 
positive nor negative semi-definite in general and satisfies 

tr(Gi"!iA„) =0,n = {2,...,7V}- (37) 

Accordingly, the min(n — 1,B) cutting planes in (l32l i can be 
re-expressed as 

H-- + A„_f,+ l)) < 0, 

6 £ {1,..., min(n — 1, B)}. (38) 

To find such a A„ to satisfy (IJTI i for the nth interval, 
we express g^li = cvec ^G„2i^ and (5„ = cvec (A„), 

(2)T ( " (2) \ 

where = tr(G„_ 2 A„j = 0. Due to the one-to- 

one mapping of cvec(-), finding A„ is equivalent to finding 
Sn that is orthogonal to g^li- Define a projection matrix 

Fn = I -express F„ = V„y„, 

where Vn £ satisfies Vn^n-i = ^ ^ttd 

VnVn = I- Thus, Sn Can be any vector in the subspace 
spanned by Vn- Specifically, we set 

(39) 


TABLE III 

Energy Comparison Based Scheme 


Algorithm 3 

1) Initialization: Set n — 0, and — Pf M^I- 

2) Repeat: 

a) n ■<— n + 1; 

b) The ET transmits with 

c) The ER sends back (if n > 2) to the ET with — —1 (or 

1) if Qn > Q-n — b (otherwise), b € {1, . . . , 5}; 

d) The ET updates the working set as using m), and computes the analytic 
center of as given in |34| ; 

e) The ET generates A^+i, and updates 

3) Until n>N. 

4) The ET estimates the MIMO channel as G^^^ — G^\ 


where p £ is a randomly generated CSCG vector 

with zero-mean and identity covariance matrix (generally 
scaled by a certain positive constant; see footnote 12) in order 
to make Sn independently drawn from the subspace. With 
the obtained Sn, we have A„ = cmat(<5„)0 where cmat(-) 
denotes the inverse operation of cvec(-). Accordingly, Sn that 
satisfies the equation in (l35l l for 6 = 1 is obtained. 

In summary, the energy comparison based feedback and 
channel learning scheme is described as Algorithm 3 in Table 

m 

C. Energy Quantization Versus Energy Comparison 

In this subsection, we compare the energy quantization ver¬ 
sus energy comparison based feedback and channel learning 
schemes via geometric interpretations on their cutting planes 
at each feedback interval n £ {2,...,Af}. For the ease of 
comparison, we assume that both schemes have the same 
working set Vn-i = the same analytic 

center G^^^i = G^n-i ^t interval n—1, as shown in Figs.|4]and 
0 for the cases of large B and small B values, respectively. 

First, we compare graphically the cutting planes returned by 
the two schemes. For the case of energy quantization, there 
are two cutting planes obtained at each interval n as specified 
in (I 2 TI 1 and (l22l i. It is observed in Figs. 01 a) and|5ja) that the 
two cutting planes are parallel to each other, since the gradient 
of the left-hand-side (LHS) in (|2TI) is just the opposite of that 
in (I 22 I 1 . It is also observed that only one of the two cutting 
planes is deep (e.g., see Fig. 0ta)) or neutral (e.g., see Fig. 
13 a)) with respect to the analytic center g|j2i of Vn-i, while 
the other is neither deep nor neutral. This is consistent with 
the training signal design in ( l25T l. 

In contrast, for the case of energy comparison, there are a 
total of min(n — 1,B) cutting planes in each interval n as 
specified in ( l38b . It is observed in Figs. 0tb) and|3b) that all 
these cutting planes pass through the common point of 0, while 
among them, only the neutral cutting plane corresponding to 
6 = 1 (i.e., /Str(GA„) < 0) passes through the analytic 

^Note that An in general contains both positive and negative eigenvalues. 
As a result, the update in a may not necessarily yield an Sn that satisfies 
both tr(5n) ^ P and Sn ^ 0. Nevertheless, by setting ||p|| to be sufficiently 
smaller than P, we can always find a p and its resulting Sn satisfying the 
above two conditions with only a few random trials. In this paper, we choose 

IIpII = P/^- 


^ nPf 





10 




Fig. 4. Energy quantization versus energy comparison based schemes: the 
case with large B. 



Fig. 5. Energy quantization versus energy comparison based schemes: the 
case with small B. 


~ (2) 

center G^_i of Vn-i- Furthermore, the non-neutral cutting 
plane with 6 = 2 (i.e., + A„_i)) < 0) is 

observed to locate between the neutral cutting plane in the 
current interval n with 6=1 (i.e., /^^^tr(GA„ ) < 0) 
and that in the previous interval n — 1 with 6 = 1 (i.e., 

/Q \ — 

,tr(GA„_i) < 0, which passes through the analytic 

~ ( 2 ) 

center G „_2 of Vn- 2 ), since the gradient of the LHS in the 
former inequality is a linear combination of those in the latter 

~ (2) 

two. Note that the point G „_2 always locates on the boundary 
of Vn-i, as a result, the cutting plane with 6 = 2 is very likely 
to cross the interior of Vn-i, making it efficiently eliminate a 
sizable portion of Vn-i- Similar properties also hold for other 
non-neutral cutting planes with 6 G {3,..., min(n — 1, B)}. 

Next, with the above geometric interpretations, we compare 
the effectiveness of the two feedback schemes for channel 
learning by considering three cases with different values of 
B. 

• First, consider the case of large B as shown in Fig. |4] 
For energy quantization, the gap between the two cutting 
planes is small (particularly, it is zero when B — c»), 
and as a result they can cut a significant portion of 


Vn-i- Therefore, energy quantization is more effective 
than energy comparison in this case. 

• Next, consider the case of small B (but with B > \) 
as depicted in Fig. |5] For energy quantization, the gap 
between the two cutting planes becomes significantly 
larger as compared to the case of large B in Fig. |4] 
As a result, the non-neutral cutting plane is likely to be 
less efficient or even redundant. In contrast, for energy 
comparison, the non-neutral cutting planes will still cross 
the interior of Vn-i, which helps cut the volume of Vn-i 
effectively even with a small B value. As a result, the 
cutting planes of energy comparison are generally more 
effective than those of energy quantization in this case. 

• Finally, consider the special case of i? = 1. In this 
case, both schemes can only return one valid cutting 
plane per interval. Intuitively, the two schemes should 
have similar channel learning performance; however, as 
will be shown by the numerical results in Section IVIII 
energy comparison is still more effective than energy 
quantization in this case. 


VI. Complexity Reduction Via Pruning Irrelevant 
Cutting Planes 


In the previous sections, we have proposed a general channel 
learning design framework based on ACCPM as well as 
two specific feedback schemes for the FT to learn the CSI, 
where at each feedback interval the FT needs to solve an 
optimization problem (see (Eb for obtaining the analytic 
center of the updated working set. As the number of feedback 
intervals n increases and that of added cutting planes becomes 
large, the complexity for solving such problems also increases 
significantly. To solve this issue, in this section we propose an 
efficient complexity reduction method by pruning irrelevant 
cutting planes at each feedback interval n after obtaining the 
analytic center of (i.e., by adding a pruning procedure 
between steps 2-d) and 2-e) in Algorithms 1-3). For brevity, we 
discuss for the general design framework in Section |III] only, 
while similar procedures can be applied for the two specific 
schemes in Sections |IV] and |V] 

The complexity reduction method is motivated by the 
example illustrated in Fig. |6] where various cutting planes 
returned at intervals 2,...,n establish a working set 
for localizing the target set (/? = {G}. It is observed that 
among these cutting planes, some of them are important 
active constraints to bound Vn (see the solid lines in Fig. |6]l, 
while the others are less relevant or even redundant (see the 
dashed and dotted lines in Fig. |6]l. Therefore, by pruning such 
irrelevant cutting planes, we could reduce the computation 
complexity for solving problem (fTTl i without degrading the 
channel learning performance. In the following, we first rank 
the irrelevance of the cutting planes. 

Consider a particular feedback interval n G {2,..., A^} and 
suppose that there are X]m =2 cutting planes at hand, which 
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Fig. 6. An example working set Vn (shown shaded) characterized by 
various cutting planes (depicted as lines). Here, the solid lines represent 
most important cutting planes for bounding Vn'. the dashed line shows a 
less important cutting plane (though still active for bounding Vn)', and the 
dotted line is a redundant cutting plane. 


form a polyhedron as 


■|G|tr(Sm^cG') — 7m,c < 0, 

VcG Cm}, VmG {2, (40) 


As compared to the working set Vn in ( fTht . we have omitted 
G ^ 0 and tr(G) = 1 in (|40] | to focus on studying the cutting 
planes only. For simplicity, we use G„ (i.e., the analytic center 
of Vn in (HSll) as the (approximate) analytic center of the 
polyhedron in (l40l i. since the constraints G ^ 0 and tr(G) = 
1 become less likely to be active when the number of cutting 
planes Gm becomes large. Define 


«'n(G) = 
n Cm 

(tr ^Sm.cG^ - 7m.c) 


Then, the ellipsoid 


-2 


cvec(Sm,c)(cvec(Sm,c))^- 
(41) 


= G 


|g (cvec(G-G„))^'3>„(G„) 
(cvec(G-G„)) < l| 

lies inside the polyhedron in (l40l i. while the ellipsoid 
Ut = |g (cvec(G- 


which is ^in scaled by a factor of X]m =2 Gm over its center, 
contains the polyhedron in (l40l i ll20l Chapter 8.5]. Therefore, 
the ellipsoid ^in at least grossly approximates the shape of the 
polyhedron in ( |40] |. and indicates the following irrelevance 
measure m 




tT{^m,cGn 


■ '^m,c 


(cvec(Sm,c))^’®'„yG„)cvec(Sm,c) 


(42) 


for the cutting plane tr(Sm,cG) — 7m,c < 0,c G 

Cm}, m G {2,..., n}. Here, larger pm.c means that 
the corresponding cutting plane is more likely to be irrelevant. 


Next, we prune irrelevant cutting planes based on the above 
irrelevance measure {pm,c}- We simply drop the cutting planes 
with largest values of Pm.c, keep the cutting planes 
with smaller values. In particular, the number of kept cutting 
planes is fixed as > 0, where Nc is a design parameter 
controlling the trade-off between the channel learning accuracy 
and computation complexity (as will be shown in more detail 
via numerical results in Section IVII-Cl l. With fixed Nc, the 
computational complexity per interval for solving problem 
© does not grow as n increases. 


VII. Numerical Results 


In this section, we provide numerical results to evaluate 
the performance of our proposed channel learning schemes. 
We consider a point-to-point MIMO WET system with one 
ER located at a distance of 5 meters from the ET, where 
the average path loss from the ET to the ER is 40 dB. For 
the considered short transmission distance, the LOS signal is 
dominant, and thus the Rician fading is used to model the 
channel from the ET to the ER. Specifically, we assume 

H = J + J ^ Jj-NLOS^ (43^ 

V 1 + Xfl V 1 + Xfi 

where G j-jjg pQg deterministic compo¬ 

nent, jjNLOS g ^MrxMt denotes the non-LOS Rayleigh 
fading component with each element being an independent 
CSCG random variable with zero mean and variance of 10“^ 
(to be consistent with the assumed average power attenuation 
of 40 dB), and XR the Rician factor set to be 5 dB. 
Eor the LOS component, we use the far-held uniform linear 
antenna array model with each row of expressed as 

10“^ [l ••• with 9 = where k 

is the spacing between two adjacent antenna elements at the 
ET, A is the carrier wavelength, and (f) is the direction of the 
ER from the ET. We set k = and (j) = 30°. Eurthermore, 
we set the number of transmit antennas at the ET as Mt = 4, 
the number of receive antennas at the ER as Mu = 2, and the 
maximum transmit sum-power as P = 30 dBm (1 Watt). 


A. Channel Learning Performance 


To start with, we compare the channel learning performance 
of the energy quantization and energy comparison based 
feedback schemes. Eigs. |7] and |8] plot the normalized error 


of estimated matrix norm, i.e.. 


G- G 


versus the number 


of feedback intervals N, under different numbers of feedback 
bits per interval, B. When B = \, it is observed in Eig. |7] 
that energy comparison achieves a lower normalized error than 
energy quantization. When i? > 1, it is observed in Eigs. |7] 
and|8]that energy quantization outperforms energy comparison 
when N is below a certain threshold (e.g., N < 30 for 
B = 2, and iV < 90 for i? = 10), while the opposite is true 
when N becomes larger than the threshold. This result can be 
explained as follows based on the geometric interpretations 
of their different cutting planes (for a given interval n) in 
Section IV-CI When the number of feedback intervals n is 
small, the working set Vn is large and energy quantization 
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Fig. 9. The two-phase transmission protocol for MIMO WET with random 
beamforming. 


Fig. 7. Normalized eri'or of estimated matrix norm versus the number of 
feedback intervals N with B = 1 and B = 2. 



Fig. 8. Normalized eri'or of estimated matrix norm versus the number of 
feedback intervals N with B = A, B = 10, and B oo. 


is more effective in space cutting with a fixed B; however, 
when n is large, becomes small and energy comparison is 
more effective due to multiple cutting planes with the same 
B value. Furthermore, when B oo, it is observed in Fig. 
0 that energy quantization obtains an exact estimate of the 
MIMO channel with the normalized error falling sharply to 
zero after N > = 16 intervals. This result is consistent 

with our discussion in Section IIV-CI 

B. Energy Beamforming Gain 

Next, we evaluate the energy beamforming performance 
with estimated MIMO channels by the two schemes, as 
compared to a reference scheme named random beamforming. 
Similar to our proposed ACCPM based channel learning, the 
random beamforming scheme implements a two-phase trans¬ 
mission protocol for channel learning and energy transmission, 
respectively, as shown in Fig.|9] However, the channel learning 
phase in this case is divided into Nn -f 1 intervals, with the 
length of Tm for the hrst Nn intervals and Tj~ for the last 
one. In the nth interval, n G {1,..., Nn}, the ET randomly 


Fig. 10. Energy beamforming gain versus the number of feedback intervals 
N with B = 1 and B = 2. 


same time, the ER measures the harvested energy Qn = 
Tmtr(G5j^). In the {Nr -\- l)th interval, the ER compares the 
measured energy amounts to obtain the maximum 

value, and sends its index n^ax = argniax„g{i 
back to the ET. Hence, the number of feedback bits in the 
{Nji + l)th interval is given by B^ = |’log 2 (iVfl)]. After 
receiving the feedback information, in the second energy trans¬ 
mission phase, the ET transmits with the energy beamforming 
vector in the rimaxth interval of the channel learning 

'‘•max ^ 



generates a beamforming vector with 
transmits with the covariance matrix = luj 


I = P and 
At the 


Fig. 11. Energy beamforming gain versus the number of feedback inteiwals 
N with B = A, B = 10, and B oo. 
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Fig. 12. Energy beamforming gain versus the number of feedback bits per 
interval B, under different values of N. 


phase. For a fair comparison, we set the length of the channel 
learning phase in the random beamforming scheme to be 
same as that in our proposed ACCPM based channel learning 
scheme, i.e., NuTm + Tj- = N{Tm + Tf). Furthermore, to 
obtain an accurate measurement result, the time consumed is 
in general larger than that for the feedback in each interval; 
hence, we assume Tm = 2Tf and Tf = in the simulation 
by ignoring the time difference in sending back different 
number of bits. Thus, we have Nr = 

Figs. [To] and [TT] depict the achieved energy beamforming 
gain X given by (ITST i in dB scale versus the number of feedback 
intervals N under different values of B. It is observed that 
when N is small (e.g., N < 30), energy comparison (energy 
quantization) achieves a higher energy beamforming gain in 
the case of S = 1 (S = 4 and B = 10), while the two 
schemes have a similar energy beamforming gain when B = 2. 
This result is consistent with the channel learning performance 
comparison in Figs. |7] and [8] In addition, when N becomes 
large (e.g., N > 40), both schemes with different values of 
B are observed to achieve energy beamforming gains close 
to the optimal one by OEB in the perfect CSI case (i.e., x* 
defined in (|5]l). Based on this observation together with those 
in Figs. |7] and |8| it is evident that a relatively rough channel 
estimation is sufficient to achieve a close-to-optimal energy 
beamforming gain for the MIMO WET system. Eurthermore, it 
is observed from Eigs.[TO|and[TT]that the random beamforming 
scheme performs much worse than our proposed ACCPM 
based channel learning schemes. Also note that unlike our 
proposed algorithms, the random beamforming scheme can 
only estimate the optimal beamforming vector, but cannot be 
used to estimate the exact MIMO channel. 

Eurthermore, it is interesting to consider a practical scenario 
where the number of feedback intervals N is kept small. Eig. 
[T 2 I shows the energy beamforming gain versus the number of 
feedback bits per interval B, with N = 10 and N = 15. 
It is observed that as B increases, the energy beamforming 
gain by energy quantization improves significantly, while that 
by energy comparison almost remains unchanged. This shows 
that energy quantization is more effective in the regime of 



Fig. 13. Normalized error of estimated matrix norm versus the number of 
feedback intervals N for energy quantization with cutting planes pruning. 

large B but limited N. 

To summarize, the above performance comparison provides 
useful insights on the selection between the energy quantiza¬ 
tion and energy comparison schemes in practice. Eor MIMO 
WET systems requiring relatively coarse channel estimation, 
energy quantization is more effective when the number of 
feedback bits per interval B is large, while energy compar¬ 
ison is more suitable when B is small. However, for other 
applications that require more accurate channel estimation 
(e.g., communication systems such as cognitive radio networks 
El, ESI), energy comparison could be more effective even 
under large values of B, provided that the number of feedback 
intervals N is sufficiently large. 

C. Performance with Cutting Planes Pruning 

Eurthermore, we evaluate the performance of the complexity 

reduction method via cutting planes pruning, in terms of 
both normalized error of estimated matrix norm and achieved 
energy beamforming gain. Due to space limitation, we only 
consider the energy quantization based scheme, while similar 
results have been observed for energy comparison and thus 
are omitted. We set B = 2 in this subsection. 

Fig ESjshows the normalized error of estimated matrix norm 
versus the number of feedback intervals N under different 
number of kept cutting planes Nc- It is observed that the cases 
with Nc = 2M|. and Nc = 3M|, achieve similar normalized 
errors of estimated matrix norm as that without pruning, while 
the case with Nc = results in noticeable performance 
degradation. 

Eig. [T4| depicts the energy beamforming gain versus N. It 
is observed that all the cases with Nc = M|., Nc = 2M^, 
and Nc = 3M|. achieve similar energy beamforming gains 
as compared to that without pruning. This demonstrates the 
effectiveness of the method of cutting planes pruning. 

D. Effect of Energy Measurement Errors 

In the above studies, we have assumed that the energy 
measurement at the ER is perfect and has no errors for the 
purpose of exposition. In the last subsection, we investigate 
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Fig. 14. Energy beamforming gain versus the number of feedback intervals 
N for energy quantization with cutting planes pnaning. 

the effect of energy measurement errors (e.g. due to the 
distortion induced by the energy harvesting circuits) on the 
channel learning performances by simulation. It is assumed 
that the measured energy amount at the ER in each interval 
n G N} is Qn = Qn + Vn, where Qn is the exact 

harvested energy given in (|7]i and denotes the error in the 
energy meter reading which follows a uniform distribution on 
the interval [—aQn, ctQn] with 0 < a < 1 being a constant. 

Consider the general design framework for channel learning 
in Section |III] Due to the energy measurement error, at each 
interval n the cutting planes in (fT4l i may not hold in general 
and thus it may occur that ^ (f. To resolve this issue, we 
modify the cutting planes in (fT4l i as 

tr(Syj 'Jn.c ^ ^n,c; C C Cn }; G {2, . . . , V}, 

(44) 

and accordingly obtain the relaxed working set at each interval 
n G {1,..., N} as 

~ '|d'|G 0, tr = 1, tr (^l^rn,cG^ — 'yrn,c < tm,c, 

VcG {1,... G {2,...,n}|, (45) 

where {tn,c > 0} are parameters designed for ensuring ^ 
ip with improved accuracy. Specifically, we set {fn.c} as the 
optimal solution of {fn,c} to the following convex optimization 
problem to minimize the distortion to the cutting planes. 

Cm 

min .EE 

{in,c},G rn—2c—l 

S.t. tr ^m,c ^ 

VcG {1,... G {2,...,n} 

G h 0,tr(G') = 1 

im.c > 0,Vc G {1,... G {2,... ,n}. 

(46) 

Then the analytic center of the modified working set V!^ is 



Fig. 15. Normalized error of estimated matrix norm versus the number of 
feedback intervals N under energy measurement errors. 



Fig. 16. Energy beamforming gain versus the number of feedback intervals 
N under energy measurement errors. 

given by 

G'„ =arg min — logdet(G) 

G'tO,tr(G) = l ^ ^ 

n Cm 

-EE log (^-tr m,cG^ 4“ ^rn,c 4“ tm,c^ ; 

m —2 c —1 

nG {l,...,iV}. (47) 

As a result, the modified generic algorithm can be obtained 
similarly as Algorithm 1 by replacing the cutting planes in 
di by those in (l45l l. and the analytic center by 

Note that the energy quantization and energy com¬ 
parison based channel learning schemes should be accordingly 
modified as above in the case with energy measurement errors. 

Fig. [15] shows the normalized error of estimated matrix 
norm versus the number of feedback intervals N subject to 
different values of energy measurement errors with B = 2. 
It is observed that for both channel learning schemes, the 
measurement errors with a = 0.001 and a = 0.01 result 
in error floors for channel estimation. In addition, when 
N > 60, energy comparison is observed to perform worse 
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than energy quantization in the case with a = 0.01, which 
is in sharp contrast to the case without energy measurement 
errors (i.e., a = 0), where the reverse is true. This shows that 
energy comparison is generally more sensitive to the energy 
measurement errors than energy quantization. 

Fig. [l6]shows the resulting energy beamforming gain versus 
N under different levels of energy measurement errors with 
B = 2. It is observed that with a = 0.001 and a = 0.01, 
the two channel learning schemes have similar energy beam¬ 
forming gains as that without energy measurement errors 
(i.e., a = 0), despite the error floor phenomenon observed 
for the normalized channel matrix in Fig. [15] This shows 
that in MIMO WET systems, our proposed channel learning 
schemes (with above robust design) are immune to the energy 
measurement errors. 

VIII. Conclusion 

This paper studied a new form of energy measurement based 
feedback for channel estimation in MIMO WET systems, by 
taking into account the energy and hardware limitations of 
practical ERs. We proposed a general channel learning design 
framework, by leveraging the technique of ACCPM in convex 
optimization. Under this framework, we developed two energy 
feedback schemes, namely energy quantization and energy 
comparison, for efficiently encoding the energy measurements 
at the ER with limited feedback. Simulation results showed 
that energy quantization is generally more effective when the 
number of feedback bits per interval is large, while energy 
comparison is preferable with small number of feedback bits. 
It is our hope that this paper provides new insights and 
will inspire future investigations on the practical design of 
channel learning for MIMO WET systems or other wireless 
communication systems based on the signal energy/power 
feedback. 
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